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^^' Abstract. We consider a dynamic mean-field ferromagnetic model in 
C^ I the low-temperature regime in the neighborhood of the zero magnetiza- 
tion state. We study the random time it takes for the system to make a 
decision, i.e., to exit the neighborhood of the unstable equilibrium and 
V^ , approach one of the two stable equilibrium points. We prove a limit 
OA ' theorem for the distribution of this random time in the thermodynamic 
limit. 

CIh 

^ ■ 1. Introduction 

-)— > 

The asymptotic properties of exit from a small neighborhood of an unsta- 
ble equilibrium of a dynamical system under small white noise perturbation 
were first studied rigorously in [7] . It was shown that as the noise intensity e 
^ goes to zero, the exit time r^ behaves roughly as a~^ lne~^, where a is the 

_!. ■ local expansion rate (Lyapunov exponent) at the equilibrium point. In [2] 

\,Q I and [3] it was shown that r^ — a^^ Ine^^ converges to a limiting distribution 

0^ ■ that is a dilation and translation of In |G[^^, where G is a standard Gaussian 

random variable. 



in 

o 
o 



Jjf^ I Understanding distributional asymptotics for the exit time was pivotal in 

describing the vanishing noise asymptotics for noisy heteroclinic networks, 



^ 



see [3] and [1]. These systems occur naturally in the context of neural dy- 
namics and sequential decision making, see, e.g., j9] and references therein. 
Exit times for diffusion models have been used in psychology to describe 
reaction times in decision tasks, see [10] and references therein, and it is 
C^ ' natural to ask if the limiting behavior of exit times described above is re- 

produced in statistical mechanics models of neural computation. 

In this paper we study one of the the simplest possible models of this kind, 
the dynamic mean field ferromagnetic model, also known as the Curie- Weiss 
model, in the low temperature regime with two minima of free energy. We 
start the evolution of the system at the completely disordered state with zero 
magnetization, where the numbers of plus spins equals the number of minus 
spins. We stop the dynamics as soon as magnetization enters a neighborhood 
of one of the stable equilibrium values and interpret that event as a decision 
made by the system between the two choices. We show that as the number A^ 
of spin variables (representing individual neurons in the neural computation 
context) goes to infinity, the exit time behaves as InA^ and the correction 
to the main term converges to an affine transformation of ln|G|~^, thus 
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reproducing the above result for the diffusion in the neighborhood of an 
unstable equilibrium. 

2. The model and the main result 

First let us recall the mean-field ferromagnetic equilibrium Ising model 
also known as Curie- Weiss model, see [5, Section IV.4]. Let us fix a large 
number A^ and consider N spin variables. Each variable X/., k = 1, . . . ,N 
takes values ±1, and the energy assigned to a configuration {xk)^^i is given 
by 

1 ^ 

We then can fix an inverse temperature value /? > and consider the 
Boltzmann-Gibbs distribution defined by E and /?: 

e-/3S(a;) 

P7v{^fc = a;fc, fc = l,...iV} = ^ , 

where 

is the partition function. 

Since there is no geometry involved in this mean-field model and the 
strength of interactions between two spins is the same for all pairs of spins, 
one can describe the macroscopic behavior of the system by a single variable 
called magnetization, 



Notice that 



^(^) = ^E^* e[-i,i]. 

E(x) = -NM^{x)/2 



^f-(f)' 



Therefore, 

I ^ ^ N\ ZN\{N + n)/2) 

if {N + n)/2 is integer. 

Recall that (see, e.g., O Lemma L3.2]) uniformly in fc = 0, 



, n, 



A^ \k j \N } \ N 

where 

h{x) = — xlnx — (1 — x) ln(l — x), x G [0, 1], 

is the entropy of the Bernoulli distribution with probabilities x and 1 — x. 
Therefore, 

n ^ 1 



P^[M{X) = ^] = ^e 



NF{n/N)+0{lnN) 
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where the free energy per spin F is defined by 

/3 2 T /^l "i 

Fim) = m — h \ — | 

^ ' 2 V2 2 

It is easy to show that the sequence of distributions Pn{M[X) € •} satisfies 
a large deviation principle on [0, 1] with rate function J given by 

Jim) = Fim) — imaF. 

[0,1] 

Differentiating F, we see that the minimizers of F satisfy 

1 , 1 + "i 

2.1 /3m=-ln- , 

2 1 — m 

and, as elementary analysis shows, (i) for /3 < 1, a unique minimizer of F is 
m = (corresponding to completely disordered case), and F"(0) > so that 
F is approximately quadratic in the neighborhood of the minimizer; (ii) for 
/? > 1, there are two minimizers m = ±771* , for some m* > 0; (iii) if /3 = 1 
then is still a unique minimizer, but contrary to the first case, -F"(0) = 0, 
and the leading term in the Taylor expansion of F at is order 4. 

In this paper we are concerned with the low-temperature case (ii). In that 
situation, point is also a solution of (|2.ip . but it is an unstable equilibrium 
of the system being the local maximum of the free energy F. We are going to 
consider stochastic dynamics compatible with Curie- Weiss model and study 
it in the neighborhood of the unstable equilibrium in the case j3 > 1. 

We must study a {— 1, +l}^-valued Markov process with intensities of 
spin flips Ci{x),i G {1, . . . , N}, x G { — 1, +1}^ defined by 

P{Xi{t + At) ^ Xi(t)|X(t) =x} = Ci{x)/\t + o(At), At I 0. 

If we want the process X to be reversible w.r.t. the Gibbs distribution P/v, 
we have to require that 

Ci(x)exp{-/3£^(x)} 

does not depend on Xj, see [8l Section IV.2]. Equivalently, we can require 
that 

Ci(x)exp{/3xiM(x)} 

does not depend on Xj. There are many choices for rates q, and there is no 
physical reason to prefer one of them to others. In this paper we will work 
with 

(2.2) Ci(a;) = exp{-/3a;iM(x)}, 

although our results should hold for a variety of other choices of Cj(x). 

Notice that if the spin Xj is aligned with magnetization M(x), then the 
resulting hipping rate of i-th spin is lower than in the opposite situation 
where Xj is misaligned with M(x). This is the result of the ferromagnetic 
nature of the model which favors configurations with most spins aligned 
with each other. 
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Suppose now that we observe only the magnetization, or, equivalently, 
the number of +l-spins. Fhpping a —1 spin means then a transition from 
the current magnetization m to m + 2/N. Since the number of —1 spins 
equals A^(l — m)/2, we see that the total transition rate m i— > m + 2/N is 
given by X+{m,N), where 

A+(m, N) = N — - — exp{/3m}. 

Flipping a +1 spin means a transition from the current magnetization m to 
m — 2/N. Since the number of +1 spins equals A^(l + m)/2, we see that the 
total transition rate m >-^ m — 2/N is given by A_(m, N), where 

X^{m,N) =N exp{-/3m}. 

Let us consider the Markov process Mjv describing the evolution of mag- 
netization in the above model and set Mi\f{0) = (this means that N has 
to be even, but this is not a really important restriction). 

It is clear that M^- will spend some time in the neighborhood of and 
then it will escape that neighborhood and head towards one of the minims of 
free energy, itm* . We can interpret the exit in each of these directions as the 
decision made by the system. We set a threshold level R G (0,m*) and as 
soon as Mi\f exceeds R in absolute value, we claim that the system has made 
the decision. The choice of one of the two alternatives is encoded by the 
sign of Mjsf at that time. Our main result describes the asymptotics of the 
random time it takes to reach the threshold R starting from the completely 
disordered state with zero magnetization. According to the interpretation 
above, this time can be viewed as the decision making time for the situation 
where the initial state is a completely unbiased indecisive state. 

More formally, for any R € {0,m^,) we introduce 

tn{R) =inf{t: |M^(t)| > R}. 

Our main result describes the joint asymptotic behavior of random vari- 
ables tn{R) and sgn Mj^{tn). To state it, we need more notation. For 
m € [—1, 1], we denote 

(2.3) b{m) = ^(A+(m, N) - A_(m, N)) = (1 - m)e^"' - (1 + m)e"'^"', 
introduce a = b'{0) = 2/3 — 2 > and Q{x) = b{x) — ax, x € M, and define 

DiR) = KiR) + ^^ + '^^^^, Rem, 
a 2a 

where 

/■^ 0(x) 

(2.4) K(R) = - / :/, dx G M, ReR. 

Jq axh(x) 
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Theorem 2.1. For any R G (0,m*), as N ^ oo 

sgnMN{TN{R)), TNiR) - ^ InA^) "^^ (sgnG, -i In |G| + D{R)\ , 
where G is a standard Gaussian random variable. 

3. Proof 

The proof is based on the theory of Markov processes, martingales and 
their convergence. We refer to |6j as an exceUent source on the relevant 
background. 

The form of rates A-(-, A_ implies that the generator Ljv of the magneti- 
zation process Mj^ is given by 

LNfim) = ^ {l-ni)e^"^(^f{m+^)-f{m) 

+(l + m)e-^-(^/(m-|)-/(m; 

In particular, 

f{MN{t)) - [ LNf{MN{s))ds 
Jo 

has to be a martingale for any /, see Proposition 1.7 in [6l Chapter 4]. 

Choosing /(x) = x on [— 1, 1], we obtain that 

Ziv(t) = MN{t) - / b{MN{s))ds 
Jo 

is a bounded variation cadlag martingale, where 6(-) is defined in (j2.3p . 
so that it plays the role of drift coefficient that drives the deterministic 
component of the process. Notice that zeros of b{m) coincide with solutions 
of equation (j2.ip . so that for any point x G (0, tti*), lim(_s.4.oo S^x = m* and 
limt^-oo S^x = 0, and for any point x G (— ?7i*,0), lim(_^+oo "^'x = — m* 
and lim(^_oo "^'x = 0, where 5 is the flow generated by b: S^x = x and 
|5*x = 6(5*x). 

Using the representation 

(3.1) b{m) = am + Q{m), ?n,G[— 1,1], 

where a = b'{0) = 2/3 — 2 and j(5(?n,)| < Km"^ for m G [—1, 1], we can write 

MN{t) = a / MN{s)ds+ / Q{MN{s))ds+ / dZN{s), 
Jo Jo Jo 

We can now use variation of constants to write 

(3.2) M^it) = e"^* [ e-^'dZN + e°* / e~'^'Q(Mjv(s))ds, 

Jo Jo 

where the integral w.r.t. Zn is understood as Lebesgue-Stieltjes integral. 
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Lemma 3.1. Suppose there is a sequence of stopping times On satisfying 

(3.3) ^jv -^ oo, TV ^ oo, 
and 

(3.4) sup |M^(t)|<7V-T, 

t<SN 

for some ■y > and all N. Then, as N -^ oo, In = N^^"^ Jq'' e'^^'dZNis) 
converges in distribution to J\f{0,2a~^). 

Proof: Let us introduce a process 

(3.5) VN{t) = / e-'^'dZNis). 

Jo 

It is a martingale with 

[VN]t = / e-2-d[Ziv]„ 
Jo 

where square brackets denote the quadratic variation process. Let us define 

Un{s) = VAr(/(s)), where 

f[s) = , s£[0,2a ]. 

2a 

We need the following statement which is a specific case of Theorem 1.4 in 
[H Chapter 7] (see also bibliographical notes therein for the history of this 
theorem and related results): 

Theorem 3.1. For each N £ N, let Un be a martingale w.r.t. some fil- 
tration, with cadlag paths and Un{^) = 0. Suppose for all t € [0,20^-*^], 
^Nit) = [UnJi satisfies 

NANit) 4 t, N ^ oo, 
and 



(3.6) lim E 



iVV2 sup \UN{t) - UN{t-)\ 
0<i<2a-i 



0. 



Then, as N ^ oo, N^''^Un converges in distribution in the Skorokhod topol- 
ogy to the standard Wiener process on [0,2a^^]. 



In our case, condition (j3.6p is fulfilled automatically since all the jumps 
of Un are bounded by 2N^^ in absolute value. All the jumps of Zn are 
equal to 2A'"~^ in absolute value, so that 

[ZNm = ^BN{t), t>o, 
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where 


BN{t) 


denotes the number of jumps the 


process 


Zjsf makes 


up 


to 


time t. 


. Next, 




^^w = 4^ 


1^ e 

s:s<f{t)AeN 
Mjv(s)^Miv(s-) 


-2as 








and 


















(3.7) 






NAN{t)^- 


rfit) 
Jo 


= t. 









This is a consequence of the fohowing result: 

Lemma 3.2. For any non-increasing function g : M+ -^ M+ and any t > 0, 

1 V- , .P '■' 



y] 9{s) -^ I g{s)ds. 

TT.n Jo 



N — JO 

s:s<tAeN 
MN{s)^MMis-) 

Proof: It is sufficient to assume that g(s) = 1s(^[om foi' some h > since 
one can use hnear combinations of functions of this form to approximate 
any non-increasing function. We have to show that for all t, 

1 p 

— 5Af(t AhAeN)^tAh. 

Due to ()3.3p . we can restrict ourselves to the high probability event {t < 9^}- 
Let (sN^i) be the increasing sequence of times of jumps for each N, i.e., 
M]\[{sN,i) / Miy{siy^i—). Clearly, conditioned on Mj^{siy^i), the spacing 
random variable sat^j+i — s^^i has exponential distribution with parameter 

\+{MN{sN,i),N) + XN{MN{sN,i),N) = \{MN{sN,i),N), 

where 

A(m, N) = A_(m, N) + A+(m, N). 

Therefore, using (j3.4p we see that the number of points Sj not exceeding 
t A h is between two Poisson processes with intensities A(0, N) = N and 
A(A^^^, N) evaluated at time tAh. Our claim follows since A(A^^^, N)/N -^ 
1. n 

Now all the conditions of Theorem 13.11 have been verified and we con- 
clude that N^''^Un converges in distribution in Skorokhod topology to the 
Wiener process on [0,20""*^]. Therefore, In = N^''^UN{2a~^) converges in 
distribution to A^(0, 2a~^), and the proof of Lemma [3. II is complete. D 

Let us now check that the conditions of Lemma 13.11 hold for 9^ = inf {t : 
\MN{t)\ > N^^}. Notice that condition (j3.4p is satisfied automatically. 

Lemma 3.3. // 4 < 7 < i, then (j3.3p holds. 
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Proof: Let us take any T > 0. On {0]\i < T} 

N-^ < |Mjv(07v)| 



(3.8) 



oC^JV 



VN{eN) + 



On 



'Q{MN{s))ds 



<e'^^{\VN{eN)\ + a~^KN-^'^), 



where V has been introduced in ()3.5p . The quadratic variation of the mar- 
tingale Vjv was computed in the proof of Lemma 13. H and we can conclude 
that 

^[VN]e^ < 2a-'N-\ 

Therefore, by the Chebyshev inequality and the fact that V^ — [Vn] is a 
martingale (see Proposition 6.1 in [6, Chapter 2]), 



P{\VN{eM)\>N-^/^-'/^}< 



-lAT-l 



EV^iON) ^ E[VN]er, ^ 2a-iiV 



< 



< 



0. 



JV-7-1/2 - iV-7-1/2 - Ar-7-1/2 

Now, on {On <T}n {\Vn{On)\ < N-*/^-^/^}, we have 

which is impossible for large A^ under our assumptions. We conclude that 
P{tn < T} ^ 0, and the lemma follows. D 

We are ready to describe the asymptotics of the exit from [— A^^''', A^^''']. 
Lemma 3.4. // 1/4 < 7 < 1/2, then 



sgnMjv(gjv), On- ^^^ ^ InA^^ "''"*" 



sgnH, In \H 

a 



where Law{H) = Af{0,2a-^). 

Proof: Considering the process Mn at time 9n and using ()3.2p . we obtain 



9n = - In 
a 



IN-'' . f ] 



_2_ 

N 



VN{eN) + /o'^ e-^Q{MN{s))ds 



i/^^lniV-iln 
a a 



In + N^'^ I e 




'QiMNis))ds 



+ 0(1). 



Also, 



sgnM,v(e^) = sgn (In + N^'^ I e-'''Q{MN{s))ds ) . 

Since 

Afi/2 / e-"*Q(Mjv(s))(is < N~^^ -^ 0, 

Jo a 

the desired statement follows from Lemma |3. 11 D 
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Let US now study the exit of M^v from an interval [— r, r] where r is a 
smah number that does not depend on N. We define Yi\f{t) = Mi\f{t + 9n) 
and for any r G {0,R) we define 

z.7v(r) = inf{t > : |y^(t)| = r}. 

We are going to compare the evolution of the magnetization process to the 
deterministic trajectory of the flow 5"* generated by the drift b. 

For any 5 > we introduce t{S, r) as the only solution t of S^5 = r, i.e., it 
is the time it takes for the solution of ODE x = b{x) to travel from 5 to r. 

Lemma 3.5. For any r > 0, 

lim {ti5,r) - - In M = Kir), 

where K{-) was defined in (j2.4p . 

Proof: By the basic formula for solutions of autonomous ODE's (see, e.g., 
P Section 1.2]): 

t(6,r)= r^= r(^-^dx+r^ 

J 5 b{x) J 5 \bi^) ax) Js ax 



r 1 ax-h{x)^ 1,^ 
Jg b{x) ax a 



and the lemma follows. D 

Lemma 3.6. There is ro > such that 

sup \YNit) -S\Yn {0))\^0, iV^oo. 

0<t<t(|yjv{0)|,n)) 

Proof: Denote AAf(t) = YN{t) - S'*YAf(0). Since 

YN{t) = yjv(O) + / b{YN{s))ds + Z'j^it), 
Jo 

where Z'j^{t) = Z]\f{t + 6j\f) — Z]\f{6]y) is a martingale, and 

S'Yn{0) = Yn{0) + I b{S'YMm)ds, 
Jo 

we see that for any r € (0, m^,) and any t £ (0, t(|YAr(0)|,r)), 

^tAi/jv(r) 

\ANitAi^N{r))\<L{r) jAiv(s)|ds+ sup [^^^(s)!, 

Jo s<tAuN{r) 

where L{r) is the Lipschitz constant of b on [—r,r]. 

Since [Z'j^]t = 4^N~'^{Bn{9n + t) — Bn{9n))-, and the number of jumps 
between 9^ and 9n + t \s stochastically dominated by the increment of the 
Poisson process with intensity A^, we have 

P< sup \Z'^{s)\> N-^\ ^{) 
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if 6 < 1/2. On the complementary event, applying Gronwall's inequality, we 
obtain for some constant C > 0, 

(3.9) |A7v(t A Un{r))\ < e^ir)ti\YNm,r)j^-5 < ^^^L{r)/a^-6^ 

We can choose r to be so small that L{r) is close to a enough to ensure 
that 7L(r)/a < 1/2. Consequently, we can choose S < 1/2 such that p = 
6 — 7-L(r)/a > 0, and the r.h.s. of (|3.9p converges to 0. For any tq € (0,r) 
we conclude then that P{;/Ar(r) < t{\Y]y{0)\,ro)} — )• 0, and 

pii^7v(r)>t(|y^(0)|,ro); sup |A^(s)| > A^-^ ^ 0. 

[ s<t{\YMm,ro) J 

which completes the proof of the lemma. D 

We can now combine the results of Lemmas 13.41 and 
Lemma 3.7. For any r € (0, rg), 

1 \ ri-iatr ( 1 In r 

sgnMjv(riv(r)), rw(r)-— In iV '^" sgni^-, — In |/7| + ^ K(t) 

la ) \ a a 

Proof: Obviously, 

(3.10) TN{r)=eN + iyN{r). 
Lemma 13.61 implies that 

i/jv(r)-t(|y^(0)|,r) Ao. 
This together with Lemma 13.51 implies 

(3.11) ^^(^)_iin-^-i^(r)-^0. 

The lemma follows now from (J3.10p . (j3.1ip . and Lemma |3.4[ D 

The next result follows from the same considerations as Lemma r3.6l except 
that it is easier since we consider a finite time horizon. 

Lemma 3.8. Let r he as in the last lemma. Let YN{t) = Mi\i{TN{r) + t). 
Then, for any T > 0, 

sup \YN{t) - S\Yn{0))\ ^ 0, N^oo. 

0<t<T 

This lemma means that after T]\f{r) the process essentially follows the de- 
terministic trajectory. Since H = 2^'^o~^'^G, where G is standard Gauss- 
ian, our main result is a direct consequence of Lemmas 13.71 and 13.81 In fact, 
it extends Lemma 13.71 since the latter is valid only for sufficiently small val- 
ues of threshold, whereas our main result applies to any R € (0, m^). D 
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